Markov decision process

Results: 537



#Item
101Playing Atari with Deep Reinforcement Learning  Volodymyr Mnih Koray Kavukcuoglu

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: www.cs.toronto.edu

Language: English - Date: 2013-12-19 11:19:32
102A subexponential lower bound for the Least Recently Considered rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich, Germany

A subexponential lower bound for the Least Recently Considered rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich, Germany

Add to Reading List

Source URL: files.oliverfriedmann.de

Language: English - Date: 2012-03-20 09:22:12
103This paper was presented as part of the main technical program at IEEE INFOCOMA Nearly-Optimal Index Rule for Scheduling of Users with Abandonment Urtzi Ayesta∗† , Peter Jacko∗ and Vladimir Novak∗‡ ∗ B

This paper was presented as part of the main technical program at IEEE INFOCOMA Nearly-Optimal Index Rule for Scheduling of Users with Abandonment Urtzi Ayesta∗† , Peter Jacko∗ and Vladimir Novak∗‡ ∗ B

Add to Reading List

Source URL: homepages.laas.fr

Language: English - Date: 2012-01-01 05:04:19
104Subexponential lower bounds for randomized pivoting rules for solving linear programs Oliver Friedmann ∗

Subexponential lower bounds for randomized pivoting rules for solving linear programs Oliver Friedmann ∗

Add to Reading List

Source URL: files.oliverfriedmann.de

Language: English - Date: 2012-02-10 07:43:14
105Performance Evaluation Performance Evaluation–22 A Modeling Framework for Optimizing the Flow-Level Scheduling with Time-Varying Channels

Performance Evaluation Performance Evaluation–22 A Modeling Framework for Optimizing the Flow-Level Scheduling with Time-Varying Channels

Add to Reading List

Source URL: homepages.laas.fr

Language: English - Date: 2012-01-01 05:03:03
106Department of Mathematics Texas A&M University 3368 TAMU College Station, TXF

Department of Mathematics Texas A&M University 3368 TAMU College Station, TXF

Add to Reading List

Source URL: see-math.math.tamu.edu

Language: English - Date: 2015-02-14 23:22:02
107A DISSERTATION IN ARTIFICIAL INTELLIGENCE  Evolutionary Dynamics of Reinforcement Learning Algorithms in Strategic Interactions

A DISSERTATION IN ARTIFICIAL INTELLIGENCE Evolutionary Dynamics of Reinforcement Learning Algorithms in Strategic Interactions

Add to Reading List

Source URL: michaelkaisers.com

Language: English - Date: 2012-12-03 03:18:47
108Submodular Surrogates for Value of Information Yuxin Chen Shervin Javdani  Amin Karbasi

Submodular Surrogates for Value of Information Yuxin Chen Shervin Javdani Amin Karbasi

Add to Reading List

Source URL: www.ri.cmu.edu

Language: English - Date: 2015-01-12 10:51:27
109A subexponential lower bound for Zadeh’s pivoting rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich,

A subexponential lower bound for Zadeh’s pivoting rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich,

Add to Reading List

Source URL: files.oliverfriedmann.de

Language: English - Date: 2012-02-10 07:43:23
110Balancing Anarchy and Central Control Individual vs. Joint Action Reinforcement Learning Daniel Claes June 18, 2010  Abstract

Balancing Anarchy and Central Control Individual vs. Joint Action Reinforcement Learning Daniel Claes June 18, 2010 Abstract

Add to Reading List

Source URL: michaelkaisers.com

Language: English - Date: 2012-04-29 08:03:28